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Here, we report the genome sequence of a novel iflavirus strain recovered from the neotropical butterfly Heliconius erato. The 
coding DNA sequence (CDS) of the iflavirus genome was 8,895 nucleotides in length, encoding a polyprotein that was 
2,965 amino acids long. 
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Heliconius butterflies are a diverse group of around 40 neotro- 
pical species that are widely studied for their wing color 
pattern diversity and Mtillerian mimicry, and display a striking 
pattern of both divergent and convergent evolution in wing phe- 
notypes (1, 2). Heliconius erato is native to Central and South 
America where a huge diversity of wing phenotypes are expressed 
across races from different geographic locations (3). Iflaviruses are 
single-stranded RNA viruses that infect insect hosts, often leading 
to developmental problems and the death of the host (4). How- 
ever, little is known about the viral (and bacterial) assemblages 
infecting Heliconius butterflies. 

We sequenced the mRNA of one male and one female of H. 
erato, obtained from a Costa Rican butterfly farm, Suministros 
Entomologicos Costarricenses, S.A. RNA was extracted from two 
individuals for each of four tissue types (the antenna, mouth- 
parts, head, and legs) using TRIzol (Life Technologies). RNA se- 
quencing libraries were prepared using the mRNA-Seq sample 
preparation kit (Alumina) and sequenced using an Illumina HiSeq 
2000 sequencer by the UCI Genomics High-Throughput Facility. 
In total, eight libraries were sequenced, producing an average li- 
brary size of approximately 24 million 100-bp paired-end reads, 
after quality control filters were applied. Paired-end libraries were 
then de novo assembled into putative mRNA transcripts using 
Trinity (5). The Trinity-assembled nucleotide (nt) contigs were 
translated into amino acid (aa) sequences, and a BLASTp compar- 
ison to the NCBI NR protein database (6) revealed the presence of 
a viral polyprotein. 

The best BLASTp hit for the discovered viral polyprotein 
matched a strain of iflavirus previously described from the Chi- 
nese oak silkmoth, Antheraea pernyi. The H. erato iflavirus poly- 
protein amino acid sequence demonstrated a 63% pairwise simi- 
larity to the Chinese oak moth iflavirus polyprotein. The coding 
DNA sequence (CDS) of the H. erato iflavirus was 8,895 nucleo- 
tides in length, encoding a 2,965-aa polyprotein. The entire iflavi- 
rus genome was 9,910 nucleotides long, which included a 906-nt 
5' untranslated region (UTR) and 110-nt 3' UTR, and had a G+C 
content of 35.3%. 

Iflavirus genomes typically encode a single polyprotein that is 



comprised of conserved domains for structural proteins in the 
N-terminal region, and non-structural proteins, RNA helicase, 
protease, and RNA-dependent RNA polymerase (RdRp) proteins 
in the C-terminal region (4). The 2,965-aa polyprotein of the 

H. erato iflavirus strain contained an arrangement of conserved 
domains common to that of previously described insect iflavirus 
genomes (7). This included two rhv-like picornavirus capsid pro- 
tein domain-like motifs (Pfam entry, cd00205), an RdRp motif 
(Pfam entry, cd01699), RNA helicase (Pfam entry, PF00910), a 
cricket paralysis virus (CRPV) capsid protein-like motif (Pfam 
entry, PF08762), and a 3-C cysteine protease motif (Pfam entry, 
PF00548). 

We present the complete genome of a novel strain of iflavirus 
isolated from H. erato. This H. erato iflavirus genomic sequence 
encodes a full and complete polyprotein when compared to other 
iflavirus genomes. The increasing availability of mRNA sequenc- 
ing data from a wide range of insect species promises to aid in the 
understanding of the ecology and evolution of insect viral infec- 
tion. 

Nucleotide sequence accession number. The genome se- 
quence was deposited at DDBJ/EMBL/GenBank under accession 
no. KJ679438. 

ACKNOWLEDGMENTS 

Thanks to Furong Yuan for providing advice on the preparation of RNA- 
Seq libraries. 

This material was based in part upon work supported by the National 
Science Foundation under cooperative agreement no. DBI-0939454 and 
grant IOS-1257627 to A.D.B. 

Any opinions, findings, and conclusions or recommendations ex- 
pressed in this material are those of the authors and do not necessarily 
reflect the views of the National Science Foundation. 

REFERENCES 

I. liggins CD, Naisbit RE, Coe RL, Mallet J. 2001. Reproductive isolation 
caused by colour pattern mimicry. Nature 41 1:302-305. http://dx.doi.org/ 
10.1038/35077075. 

2. Mallet J, Joron M. 1999. Evolution of diversity in warning color 
and mimicry: Polymorphisms, shifting balance, and speciation. Annu. 



May/June 2014 Volume 2 Issue 3 e00398-14 



Genome Announcements 



genomea.asm.org 1 



Smith et al. 



Rev. Ecol. Syst. 30:201-233. http://dx.doi.org/10.1146/annurev.ecolsys 
.30.1.201. 

3. Supple M, Papa R, Counterman B, McMillan WO. 2014. The genom- 
ics of an adaptive radiation — insights across the Heliconius speciation 
continuum, p 249-271. In Landry CR, Aubin-Horth N (ed), Advances 
in experimental medicine and biology, vol 781. Ecological genomics: 
ecology and the evolution of genes and genomes. Springer, Dordrecht, 
Netherlands. 

4. van Oers MM. 2010. Genomics and biology of iflaviruses, p 231-250. In 
Asgari S, Johnson K (ed), Insect virology. Caister Academic Press, Norfolk, 
United Kingdom. 

5. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, 



Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, 
Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, 
Lindblad-Toh K, Friedman N, Regev A. 2011. Full-length transcriptome 
assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 
29:644-652. http://dx.doi.Org/l 0. 1 038/nbt. 1883. 

6. Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden 
TL. 2008. NCBI BLAST: a better web interface. Nucleic Acids Res. 36: 
W5-W9. http://dx.doi.org/ 10. 1093/nar/gkn20 1 . 

7. Sparks ME, Gundersen-Rindal DE, Harrison RL. 2013. Complete 
genome sequence of a novel iflavirus from the transcriptome of Halyo- 
morpha halys, the brown marmorated stink bug. Genome Announc. 
I(6):e00910-13. http://dx.doi.org/10.1128/genomeA.00910-13. 



2 genomea.asm.org 



Genome Announcements 



May/June 2014 Volume 2 Issue 3 e00398-14 



